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Learning Outcomes 
Introduction 


Practice 
e Icebreaker, Importing and Exporting 
Data Cleaning 
Analysis 
Visualisation 


Recap + Feedback Form 


A AfterWork 


EE LEARNING OUTCOMES 


e |can perform data cleaning techniques to prepare data for further 


analysis using Excel. 
e |can perform data analysis techniques to answer business problems 


using Excel. 
e |can perform data visualisation techniques to communicate insights 


to stakeholders using Excel. 
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E WHAT ARE SPREADSHEETS? 


e Definition 
o Computer programs that capture, display and manipulate data 
arranged in rows and columns. 
o Used to create reports and dashboards which can be comprised of 
summary tables, visualisations, descriptions, etc. 
o Most familiar use is data presentation. 
o Formulas are useful for computing sales, budgets, and other 
numeric summaries. 
o Convenient for analysis of structured data and data visualisation. 
e Concepts 
o Cells, Sheets, Formulas, Functions, Charts, Pivot tools, 
What-if-Analysis etc. 
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HE SPREADSHEET APPLICATIONS 


Microsoft Excel by Microsoft (Proprietary) 
Google Sheets by Google (Free) 
Libreoffice Calc (Free and Open Source) 
Lotus 123 by IBM 
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BM WHY SPREADSHEETS? 


e One of the easiest structured methods to storing and 
manipulating data. 

e Ability to perform statistical calculations. 

e Ability to perform data analysis and visualisation 

e Best tool to get the job done i.e. producing reports is an easy 
task. 
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ΠΝ PROJECTS 


e Microsoft Excel is a very versatile tool and can be used for almost 
anything that you can imagine: 


Agendas, Budgets, Calendars, Cards, Charts and diagrams, Financial tools 


(loan calculators, etc.), Flyers, Forms, Inventories, Invoices, Lists and to-do 
checklists, Planners, Plans and proposals, Reports, Schedules, Timesheets 
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a ICE BREAKER 


Come up with as many questions as possible that can be 
answered from the clean titanic dataset. 
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ΜΝ PRACTICE: IMPORTING 8 EXPORTING 


Concepts 
e Importing data 
e Exporting data 


Exercise 
e Addanewcolumn to the left of temperature column, Fahrenheit. 
e Use formula to compute values from temperature column. 
o Note: Fahrenheit =1.8C + 32 
e Export the data as a CSV file. 


Dataset 
ο https://bit.ly/CitiesDataset| 
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ΠΠ PRACTICE: DATA CLEANING 


Concepts 
e Data Cleaning Techniques 


Exercise 
e Perform the data cleaning techniques outlined in the given guidelines 
document and save the file for further analysis. 


e Data Cleaning guidelines: https://bit.ly/DataCleaningGuidelines 


Dataset 
e https://bit.ly/DirtyTitanicDataset 
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Ma PRACTICE: DATA ANALYSIS 


Concepts 
e Sorting, Filtering, Aggregation, Joining, Pivot Tables: Restructuring / 
aggregation / analysis 
Exercise 
e How many cities in Italy? 
e What is the average latitude: 


o Overall? 
o For cities with temperature < 10 ? 
o For cities with temperature > 10 ? 
o For cities where both the city name and the country name end in the 
letter “a” ? 
o Which are warmer on average - cities in the EU or cities not in the EU? 
Dataset 


ο = https://bit.ly/CitiesDatasetl and https://bit.ly/CountriesDataset| 
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EEE PRACTICE: DATA ANALYSIS (TITANIC) 


Concepts 

e Sorting, Filtering, Aggregation, Joining, Pivot Tables: Restructuring / 

Aggregation / Analysis. 

Exercise 

e How many passengers sailed for free? 
How many married women over age 50 embarked in Cherbourg? 
How many passenger ages are missing? 
What is the average fare paid by these passengers? 
What is the most common last name among passengers, and how 
many passengers have that last name? What is the average number 
of passengers per last name? 
What is the average fare paid by passengers in the three classes? 

e What is the average age of passengers in the three classes? 
Dataset 

e Clean Titanic Dataset 
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ΠΝ PRACTICE: DATA VISUALISATION 


e Effective Data visualisation 
o Hans Rosling's 200 Countries, 200 Years, 4 Minutes: 
m https:/www.youtube.com/watch?v=jbkSRLYSojo 
Suggested charts are often good ones. 
For help while working with charts i.e. Web search or fiddling 
Don’t underestimate the power of basic visualizations 
Bar charts, Pie charts, Scatterplots and Sparklines 
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a BAR CHARTS 


e Concept 
o Bar charts: When one axis is categories and the other is numeric. 


e Exercise 
o Ten countries with the highest population, bar chart showing 


populations. 


e Dataset 
ο =©https://bit.ly/CountriesDatasetl 
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a PIE CHARTS 


e Concept 
o Pie charts: To compare relative sizes of categories. 


e Exercise 
o Pie chart showing relative number of cities with negative longitude 


and positive longitude. 


e Dataset 
o https://bit.ly/CitiesDatasetl 
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ΠΝ SCATTER PLOTS 


e Concept 
o Scatter plots: When both axes are numeric. 


e Exercise 
o Useascatter plot to explore whether there is any relationship 
between the latitude of cities in a country (x-axis) and the population 
of that country (y-axis). 


e Dataset 
o =©https://bit.ly/CitiesDatasetl and https://bit.ly/CountriesDataset| 
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ΠΠ PRACTICE: DATA VISUALISATION (TITANIC) 


Concepts 

e Data visualisation Techniques: Bar Charts, Pie Charts and Scatter plot 
Exercise 

e How many passengers sailed for free? 
How many married women over age 50 embarked in Cherbourg? 
How many passenger ages are missing? 
What is the average fare paid by these passengers? 
What is the most common last name among passengers, and how 
many passengers have that last name? What is the average number 
of passengers per last name? 
What is the average fare paid by passengers in the three classes? 

e What is the average age of passengers in the three classes? 
Dataset 

e Clean Titanic Dataset 
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ΠΝ DELIVERABLE 
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ΠΝ LIMITATIONS 


Automated data cleaning procedures for lots of data can be tedious. 
Manual process for analysis 

Hard to consolidate many data sources through automation 

Lack of version control 
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ΠΝ DEBRIEF+Q&A 


e |can perform data cleaning techniques to prepare data 
for further analysis using Excel. 

e |can perform data analysis techniques to answer 
business problems using Excel. 

e |can perform data visualisation techniques to 
communicate insights to stakeholders using Excel. 
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ΠΝ CHECK OUT FORM 


https://bit.ly/ExcelBasicsforDataScience 


GO FORTH AND PROSPER A AfterWork 


